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Abstract 

The fixed-lag Kalman smoother of Cohn et al. ( 1994), or a version of a more computationally feasible 
approximation of it developed by Todling et al. ( 1998), is under implementation at the Data Assimilation 
Office (DAO) to build a retrospective data assimilation system (RDAS). The initial version of the RDAb 
uses the physical-space statistical analysis system (PSAS; Cohn et al. 1998) and a modification of the 
incremental analysis update (IAU) procedure of Bloom et al. (1996); a follow up version of the RDAS 
will require the use of the adjoint of the DAO general circulation model (developed by Nang and 1. 
M. Navon). The retrospective procedure is designed to produce improved analyses as well as improve 
assimilated fields consequently providing an improved climate representation through data assimilation. 


1 The lag-1 RDAS formulation 

Let y(cr) represent an n-vector state of the general circulation model (GCM) on its sigma vertical 
coordinate system. Moreover, let x(p) represent an rn- vector state of the analysis component of the 
DAS, which "in the case of the DAO system is built in pressure coordinates. This way, we can write 
PSAS analysis increments Sxk\k(p)y at time #&, as 

<Sx*|fc(p) = Xfc|fc(p) - x fc | fc _i(p) = Pfcifc-iHfc l Vk (1) 


where the forecast error covariance matrix P{ |fc _! is defined on a pressure coordinate system. In the 
expression above the r k x m matrix H* converts pressure fields to observation locations, and the r k x n 
matrix T k is the error covariance matrix of the /-/.-residual vector of observation-minus-forecast v k = 
x£ - H A .x fc | fc_i(p), and is given by 

+ ( 2 ) 

where is the r j. x observation error covariance matrix. 

Assume now the existence of a nonlinear operator n that transforms GCM prognostic sigma fields 
on to analysis/first-guess pressure level fields. That is, for a given GCM n-vector background field y(<r) 
an /n-vector first-guess field x(p) can be derived according to 

x(p) = fl[y(<r)] . (- ? ) 


The operator n is nonlinear since it represents more than just simple interpolation procedures: it stands 
for transformation of variables as well, such as converting potential temperature into heights or deriving 
sea level pressure and sea level winds through planetary boundary layer effects. In the DAO system, the 
GCM background state vector is composed of the two zonal and meridional wind components, potential 
temperature, specific humiditv and surface pressure, that is, y{cr) - - y{u.v,0.<i.n)[cr): the analysis 
state vector is composed of the sea-level pressure, sea-level zonal and meridional wind components, 
upper-air zonal and meridional wind components, geopotential height and mixing ratio, that is, x(p) - 

x(p,i. u,i, v,i, u, v, h.q m ){p). . 

It is now convenient to introduce an operator 11+ which takes an analysis state vector x(p) in to a 

model state vector y(cr) according to 

y (cr) = n + [x(p)] . ( 4 ) 
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respectively. The actual implementation of II + is such that it renders minimal the difference ||x(p) 
n[[I + [x(p)]]||. In other words, the error is minimal when transforming an analysis state vector in to a 
model state vector and then back to an analysis state vector. In a nonlinear sense, 11+ is a pseudo- 
inverse of n Analogously, II + must be a pseudo-inverse of II; in practice, numerical inaccuracies and 
the character of the original nonlinearities are such that this can only be achieved to a certain extent. 
With the definitions above, the increment (1) can be converted in to a model increment vector as 

Syk\ki<r) = n + [x fc |,..(p)] -yfc|fc-i(<7) • (') 

which is then be used, after proper rescaling, as the IAU forcing term (see Bloom et al. 1996). 

From Todling and Cohn (1996) the lag-1 retrospective analysis increment is given by 

<*x fc _i|fcji.-i(p) = x k -i\k(p)-*k-i\k-i(p) = P fc-i|fc-i A M-i H * I V Vfc ’ ^ 

which as in the regular (filter) analysis is given in pressure coordinates. The operator A fc fe _i 1S the 
adjoint of the tangent linear model of the GCM and of all transformations involved in taking state 
vectors from the model to the analysis grid and vice-versa; the operator P^n^j is the analysis error 
covariance obtained from 

P2-l|*-l = (I- K *-l (9) 

To derive the proper equation for the operator Ak,k-i we introduce the GCM evolution equation 
through an operator that acts on model fields as follows 

yicifc-i(^) = •^(y*-ii*-i( cr )] ■ 

The nonlinear GCM evolution operator M has the Jacobian operator defined as 
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dM [y] 
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(ii) 


which also acts on model (perturbation) fields. Together with the Jacobians introduced previously, the 
operator A^-i can shown to have the following explicit form 

A ,. fc _r(p) = n fc M*.*-i(<r)n+_ l . U2) 


The need for the adjoint of A fc . fc _i(p) in (8) indicates the requirement for constructing not only the 
adjoint, Ml k _ x , of the model's tangent linear operator M but also the ac U oint bot ' 1 aiK 

n + The final lag-1 retrospective analysis increment on the model variables is derived as 

6yk-i\k.k-i(<r) = yk-i\k(<r) - y k -i\k-i{<r) 

= 11+ (x fc _i|fc_i(p) + <Sx fc _i|fc.fc-i(p)) - yfc-iifc-ii* 7 ) • (!•*) 

This increment can be rescaled properly and used in the IAU procedure to obtain the retrospective IAf. 
assimilation. The motivation for using the retrospective analysis increments <$x fc _ m _i(p) to produce an 
IAU increment that is then used to force the model and perform another GCM integration is similar to 
that of the original IAU procedure, i.e., to produce a consistent dynamically balanced and continuous 
history of the model fields. 



2 The approximate RDAS 


Let us consider the expression for the lug- 1 retrospective increment when 
replacing the adjoint of the dynamics operator j,_| with the identity 
write the lag-1 retrospective analysis increment as 


we take the approximation of 
matrix. In this case, we can 


dX T fc ly A- 

= (I - (Id) 

where the last equality is obtained after substituting the analysis error covariance at i with its explicit 
form, and a tilde is used to indicate this is an approximate quantity. 

In practice, in many three-dimensional variational systems, the forecast error covariance matrix 
p£ o Is constant in time 1 . If we take this matrix to be constant we can write 

Sik-i\k,k - i = (I - Kfc_i|fc_iHfc_i)<5x fc | fc (15) 

where 6x k \ k is the regular (filter) analysis increment. Thus, the expression above provides an extremely 
simple way of calculating the lag-1 retrospective increment given the analysis increment. As a matter of 
fact, the last term in the expression above corresponds to an application of PSAS to the vector Hfc_i<!iXfc|fc. 
which replaces the innovation vector needed when calculating the filter analysis. This quantity represents 
the projection of the current (filter) analysis increments on the observation grid of the previous analysis 

time. 


IAU Assimilation vs Approximate Retrospective IAU Assimilation 
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Figure 1: Schematic representation of IAU (within heavy gray rectangle) and retrospective IAU (within 
light gray rectangle). Dasherl arrows represent the GCM integration: solid arrows represent GCM 
integration forced by IAU (thin arrows) and retrospective IAU increment (thick arrows). 


The expression for retrospective analysis increments can be extern 
the general lag /' result being 

6x k _fu-,k- 1 - (I~ 


led to lags higher than one, with 


<.k- 1 


(U3) 


where Sx k .,\ k k-i = *k-e\k - x k -i\ k - 1- The equation above states that each lag ol the approximate 
retrospective procedure can be obtained from the retrospective increment ol the previously calculated 


'This is not the ease in the DAO's System, however, for .ill practical purposes the forecast error covariance matrix can 
be considered to be constant since it varies very slowly in time. 





retrospective increment, with one additional PSAS application. With the assumption of constant forecast 
error covariances, the same can be shown to be true for the algorithm involving the adjoint operator(s) 
of the previous section. We remark further that. 


• When the forecast, error covariance matrix P' is kept constant, the only time dependence in the 
gain matrix K fc _ i|*— 1 comes from the time dependence of the observing network. 

• It can be shown that, when the forecast errors are indeed independent of the dynamics, expression 
(1(5) corresponds to the optimal solution for the lag-* retrospective increments, in the linear, 
minimum variance sense. 


. Furthermore, when the observing system is constant in time, the lag-1 retrospects analysis a - 
gorithm corresponds to a single step of the bias estimation procedure oi Dee and da Silva (1 JJJ). 
when the bias first-guess is null. In other words, each lag-1 retrospective analysis corresponds to 
a less biased analysis than that of the filter. 

A schematic representation of the approximate lag-1 RDAS is shown in Fig. 1, where the thick solid 
arrows in the lower, light shaded, rectangle indicate the retrospective IAL assimilation trajectory. 


3 Preliminary results 

In this section we briefly show the performance of the RDAS versus that of the regular DAS using an 
identical-twin experiment set up. We construct observations from a control run of the DAS. e con xo 
run is a complete DAS run with the full observing network of rawmsondes, TOVS height retrieva s, 
conventional surface data and Wentz total precipitable water. This experiment serves as the truth, from 
which we construct observations, for the following experiments, at exactly the same locations as the 
original observations. 
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Figure 2: Assimilation versus lag-1 retrospective assimilation. Panels show 10-day mean absolute error 
for heights (left) and zonal winds (right), at 1 mb. Top panels are for assimilation, bottom panels are 
for retrospective assimilation. 

Two experiments are ran for a 10-clay period with the DAS and the RDAS. In Fig. 2 we *ho\\ the 
(10-day) mean absolute error in both these experiments for the the height and zonal wind fie Is a 
mh>. High in the stratosphere the main (only) source of data are the TOYS height retrieva s. e see 





from the figure a considerable improvement in the assimilation when using the retrospective procedure 
(bottom panels). The reduction in errors in the height field is more pronounced than for the winds since 
the only source of wind correction this high up comes from the multivariate aspect of PSAS. Since the 
error covariance formulation in PSAS is nearly geostrophic, with a weak relaxation from geostrophy near 
the equator, wind error reduction is not as significant along the tropus. 
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Figure 3: Absolute mean error in mass stream function for lag-1 retrospective assimilation (left) and 
assimilation (right). 

Another indication of the overall improvement obtained in the assimilation through the RDAS is 
seen in Fig. 3 where the absolute error in the mass-stream function, averaged over the 10-day period 
of the experiments, is shown. We see a slight improvement in the Hadley circulation during this period 
when comparing the RDAS (left panel) against the DAS (right panel). Other climatologically significant 
quantities show similar marginal improvement. 


4 Conclusions 

Preliminary results indicate the lag-1 RDAS to provide improvements over the DAS. Experiments for 
time scales of a month to a season are being conducted to assess more accurately the impact on the 
assimilation climatology. Examination and experimentation of the complete RDAS procedure involving 
the adjoint operators will follow. 


